Abstract
Background: Digital twin (DT) systems have emerged as a promising approach in health care, enabling real-time, patient-specific virtual modeling and personalized interventions. In diabetes care, DTs offer the potential to revolutionize glucose management, decision support, and therapy personalization through integration of real-time and longitudinal patient data.
Objective: This scoping review mapped the current landscape of DT applications in diabetes and synthesized evidence across 13 research questions organized into 7 thematic domains: system design, target conditions, data sources, personalization strategies, intelligence and adaptability, validation methods, and implementation considerations.
Methods: This scoping review was conducted in accordance with the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) and JBI methodological guidance for scoping reviews. A literature search was performed in PubMed, IEEE Xplore, Scopus, and Web of Science for studies published up to April 2025; all databases were last searched on June 23, 2025. Eligible studies were original empirical articles in English that described patient-specific DT systems or closely related individualized virtual models applied to diabetes diagnosis, monitoring, management, treatment, or complication-related care. Reviews, editorials, commentaries, theoretical papers without original data, and studies not focused on diabetes were excluded. Furthermore, FSR, MJ, and KK independently screened records and assessed full texts, with disagreements resolved through discussion and, when needed, by EB. Data were charted using a structured framework based on 13 predefined research questions, and were synthesized descriptively and thematically.
Results: Of 208 records identified, 123 underwent title and abstract screening, 39 full texts were assessed for eligibility, and 28 studies were included. Most studies focused on type 1 or type 2 diabetes and used data-driven, hybrid, or simulation-based DT approaches. Common clinical applications included therapeutic control, glucose prediction, decision support, and disease management. Lifestyle data, wearables, continuous glucose monitoring, and electronic health records were the dominant inputs, while personalization relied on adaptive feedback, insulin optimization, and behavior-driven tools. Intelligent features, such as adaptive learning, explainable artificial intelligence, and real-time synchronization, enhanced adaptability, although human oversight was rare. Validation was mainly retrospective or simulation-based, with few clinical trials; reported outcomes included improved hemoglobin A1c, time-in-range, and reduced hypoglycemia. Ethical discussions focused on data privacy, while implementation barriers centered on validation gaps, data quality, and workflow integration.
Conclusions: DT research in diabetes is expanding and shows strong potential for personalized and data-driven care; however, the evidence base remains heterogeneous, inconsistently reported, and limited in prospective clinical validation. Key gaps include standardized definitions, robust real-world evaluation, fairness and governance considerations, and integration into clinical workflows. Future work should prioritize clinically grounded validation, regulatory readiness, and interoperable architectures to support safe, equitable, and scalable implementation.
doi:10.2196/83059
Keywords
Introduction
A digital twin (DT) is a dynamic, virtual representation of a physical system—such as a patient—that is continuously updated with real-world data and computational models to support prediction, simulation, and decision-making []. In health care, DTs are a powerful tool for personalized medicine, providing real-time, data-driven insights tailored to individual patients [,].
Diabetes mellitus, encompassing both type 1 and type 2 diabetes, remains a major chronic health condition requiring highly individualized care [,]. The complexity of diabetes management—driven by variability in disease trajectories, treatment responses, and complication risks—requires approaches that move beyond traditional one-size-fits-all models. DTs address this need by simulating glycemic dynamics, forecasting outcomes, and supporting therapy optimization on a patient-specific basis [,,,]. These models integrate diverse data sources, such as continuous glucose monitoring (CGM), insulin dosing records, electronic health records (EHRs), wearable sensors, genomic information, and lifestyle factors [,,].
Recent research highlights the potential of DTs in diabetes for applications, such as predicting disease progression, personalizing nutrition, enhancing automated insulin delivery systems, and supporting self-management [,-]. For instance, DT frameworks that combine machine learning, multimodal data, and mechanistic modeling have been used to predict glycemic and complication-related outcomes in diabetes [,,,,]. Early-phase clinical and real-world studies suggest potential improvements in glycemic control, reduced medication use, and enhanced metabolic outcomes with DT-based interventions [,,-].
However, several barriers still hinder broader adoption and clinical integration. Key challenges include data integration and model personalization [,,], limited interoperability across devices and systems [,,], the absence of standardized validation and regulatory pathways [,,], and unresolved concerns around data privacy and ethical use [,].
Despite promising progress, DT research in diabetes remains fragmented and undervalidated. While some reviews have examined digital health tools in diabetes or explored DTs in general health care contexts [], no previous review has systematically synthesized DT applications in diabetes across key dimensions such as system design, personalization, data integration, validation, and implementation. This gap limits the ability of researchers, clinicians, and developers to assess maturity levels, identify best practices, and guide future development.
To address this gap, we conducted a scoping review guided by the following research questions. The review addresses 13 research questions (RQs) grouped under 7 thematic domains to improve clarity and synthesis.
- System design and modeling foundations:
- RQ1: What types of DT models have been developed for diabetes care and management?
- RQ2: What system components are included in these models?
- RQ3: What modeling approaches are used in these systems?
- Target conditions and use context:
- RQ4: What types of diabetes are addressed by these DT applications?
- RQ5: What clinical goals do these DTs aim to support?
- Data sources and personalization mechanisms:
- RQ6: What data sources are used to build or update DTs for diabetes?
- RQ7: How are DTs used to enable personalized care or self-management in diabetes?
- Intelligence and adaptability:
- RQ8: How do the DTs handle uncertainty, real-time data updates, and model interpretability?
- Evaluation and validation:
- RQ9: What outcomes have been reported from applying DTs in diabetes care?
- RQ10: What methods have been used to validate these DT systems?
- Implementation and governance:
- RQ11: What ethical or legal issues are raised regarding the use of DTs in diabetes care?
- RQ12: What barriers and enablers are reported for implementing DT systems in clinical practice?
- Research and development gaps:
- RQ13: What gaps in knowledge or practice are identified in the literature on DTs in diabetes?
By systematically synthesizing evidence across these domains, this review provides a comprehensive overview of the current state of DT research in diabetes. The findings aim to inform researchers, clinicians, and technology developers about prevailing trends, methodological practices, and future opportunities for advancing personalized diabetes care through DT technologies [,,].
presents a synthesized architecture of DT systems in diabetes based on the common components identified across the included studies.

Methods
Overview
This scoping review was conducted in accordance with the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) and JBI methodological guidance for scoping reviews [,].
Information Sources and Search Strategy
A comprehensive literature search was conducted through PubMed, IEEE Xplore, Scopus, and Web of Science. Studies published up to April 2025 were considered, and all databases were last searched on June 23, 2025. The search strategy combined terms related to “digital twin,” “diabetes,” and “healthcare” using Boolean operators. The detailed search strategy is provided in the . Reference lists of included studies and relevant reviews were also manually screened to identify additional records.
The search strategies developed for PubMed, Web of Science, IEEE Xplore, and Scopus were imported into the Triple-A (Article Analysis Assistant) software []. The tool was used to integrate bibliographic metadata, automatically remove duplicate records based on DOI, and perform additional deduplication using title, publication year, and author names. Reviewer decisions were subsequently imported into the platform, and the finalized dataset was prepared for downstream analysis and thematic synthesis.
Eligibility Criteria
Eligibility criteria were established a priori to ensure consistency and reproducibility during screening.
Inclusion Criteria
Studies were included if they were original empirical research articles, including peer-reviewed journal papers, conference proceedings, or preprints. Studies were eligible if they reported on the development, validation, implementation, or clinical evaluation of DT systems for diabetes, including type 1, type 2, gestational, or related complications. Research involving patient-specific modeling, simulation, or data-driven approaches relevant to diabetes diagnosis, management, or treatment was included. Articles addressing applications in personalized or precision medicine, clinical decision support, or individualized therapy for diabetes were also included. Publications were required to be written in English, with a structured abstract and an accessible full text.
Exclusion Criteria
Studies were excluded if they were review articles, meta-analyses, editorials, commentaries, book chapters, or theoretical or conceptual papers without original data. Studies focused on DTs for diseases or systems other than diabetes, such as cardiovascular, neurological, or orthopedic applications, were excluded. Articles lacking an abstract or full text, or published in languages other than English, were also excluded.
These criteria were set before the screening process to maintain consistency and transparency in study selection. During screening, FSR, MJ, and KK independently assessed each record for eligibility using the predefined criteria. Discrepancies or uncertainties were resolved through discussion, with EB consulted when necessary.
Study Selection
The selection process involved 3 stages—identification, screening, and eligibility assessment. We initially identified 208 studies from 4 major databases—PubMed (n=47), IEEE Xplore (n=1), Scopus (n=107), and Web of Science (n=53). During identification, 85 articles were excluded due to duplication, lack of an abstract, absence of original data, or being published in a language other than English.
Following this step, 123 articles proceeded to screening. At the screening stage, 84 articles were excluded according to the predetermined exclusion criteria. As a result, 39 articles advanced to eligibility assessment, and 28 were included in the final review [,-,-]. The 11 full-text articles excluded at the eligibility stage and the reasons for exclusion are listed in .
Screening was conducted in 2 stages:
- Title and abstract screening: FSR, MJ, and KK independently assessed each record against the predefined eligibility criteria.
- Full-text screening: Articles passing the first stage were retrieved in full and assessed for final inclusion.
To ensure consistent inclusion decisions, the following screening questions were applied, reflecting the key characteristics of DTs and their application in diabetes care (). For the purposes of this review, a study was considered to describe a DT if it included a patient-specific virtual representation or individualized computational model linked to diabetes-related data and intended for prediction, simulation, monitoring, or decision support. Studies using terms such as “virtual patient” or “simulation model” were included only if these DT-defining characteristics were present. Generic population-level models without individualized representation or diabetes-specific application were excluded.
| Screening question | Decision criteria |
| FQ1: Does the study discuss or apply DT technology? | Include only if the study explicitly referred to a DT or described a patient-specific virtual representation or individualized computational model linked to diabetes-related data and intended for prediction, simulation, monitoring, or decision support. |
| FQ2: Is the study focused on diabetes or diabetes-related conditions? | Include only if the main population or application domain involves diabetes (type 1, type 2, and gestational) or closely related metabolic conditions (eg, diabetic nephropathy and retinopathy). |
| FQ3: Is the DT model tailored to individual patients or based on patient-specific data? | Include only if the DT system is personalized using real or simulated patient-specific data (eg, glucose levels, insulin history, CGM, and EHRs). Exclude if the system is generic or population-level only. |
aFQ: filtering question.
bDT: digital twin.
cCGM: continuous glucose monitoring.
dEHR: electronic health record.
Data Extraction and Thematic Framework
Data from the 28 included studies were charted using a structured framework guided by the 13 predefined research questions introduced in the Introduction section. These research questions were organized into seven thematic domains to facilitate systematic synthesis: (1) system design and modeling foundations (RQ1, RQ2, and RQ3), (2) target conditions and use context (RQ4 and RQ5), (3) data sources and personalization mechanisms (RQ6 and RQ7), (4) intelligence and adaptability (RQ8), (5) evaluation and validation (RQ9 and RQ10), (6) implementation and governance (RQ11 and RQ12), and (7) research and development gaps (RQ13).
Each included study was analyzed systematically using this framework. Categories were not mutually exclusive, and individual studies could be charted under more than 1 category where appropriate. The full study characteristics and data charting table are provided in .
Consistent with scoping review methodology, formal risk-of-bias, reporting bias, and certainty-of-evidence assessments were not performed because the aim was to map the breadth, characteristics, and gaps in a heterogeneous body of literature rather than to compare intervention effects or generate pooled estimates.
A review protocol and project materials for this scoping review were made available through the Open Science Framework (OSF) [].
Results
Overview
Across the 28 included studies (as shown in the PRISMA [Preferred Reporting Items for Systematic Reviews and Meta-Analyses] flow diagram in ) [,-,-], DT systems for diabetes exhibited diverse architectures, data sources, and application goals. Most models were data-driven or hybrid (artificial intelligence [AI]+mechanistic), while purely mechanistic and conceptual designs were less common. Core system components included machine learning (ML) or AI modules, decision support layers, and real-time simulation engines. The majority of DTs leveraged CGM, wearables, and lifestyle data, with increasing use of patient-specific models to enable personalized therapy, behavioral nudges, and simulation-based feedback.

ML was the dominant modeling approach, while reinforcement learning, control theory, and signal processing appeared less frequently. Strategies for uncertainty management and interpretability were adopted inconsistently, with adaptive learning and explainable AI used in some studies, but with limited human-in-the-loop oversight. Reported outcomes most often focused on glycemic control (eg, hemoglobin A1c [HbA1c], time-in-range [TIR], and reduced hypoglycemia), alongside improvements in predictive accuracy, metabolic markers, and patient engagement. However, external clinical validation remained scarce, with most evaluations based on retrospective datasets or simulations.
Ethical considerations—mainly privacy and transparency, with occasional references to accountability and bias—were inconsistently addressed. Implementation barriers included validation limitations, data quality issues, model limitations, and workflow misalignment. Finally, the literature highlights persistent research gaps in integration with real-world systems, scalability, and methodological rigor that must be addressed to advance DT systems into clinical use.
System Design and Modeling Foundations (RQ1, RQ2, RQ3)
Overview
This section describes how DT models in diabetes are structured and modeled. It summarizes the types of models used (RQ1), the core system components included (RQ2), and the computational modeling strategies adopted (RQ3). Together, these questions cover the architectural and technical foundations of DTs in diabetes.
Model Types (RQ1)
DT models in diabetes care fall into 4 main categories—data-driven, hybrid, mechanistic, and conceptual. Data-driven models—most commonly using ML or deep learning—were used in half of the studies and focused on prediction and classification tasks. Hybrid models, which combine physiological modeling with AI, support real-time control systems, such as automated insulin delivery. Mechanistic models appeared less frequently and were primarily used in simulation studies. Conceptual frameworks were rare and largely theoretical. summarizes the types of DT models reported in diabetes care, with representative examples from included studies.
| Model type | Key characteristics | Studies, n | Representative examples |
| Data-driven | ML, DL, RL; CGM-based | 14 | Shamanna et al [], Shamanna et al [], Vaskovsky and Chvanova [], Shamanna et al [] |
| Hybrid | ML+mechanistic model | 10 | Sarani Rad et al [], Cappon et al [], Colmegna et al [], Ahmadasas et al [] |
| Mechanistic | ODEs, simulations | 5 | Young et al [], Thamotharan et al [], Wang et al [], Zavitsanou et al [] |
| Conceptual | Framework only | 1 | Mishra et al [] |
aML: machine learning.
bDL: deep learning.
cRL: reinforcement learning.
dCGM: continuous glucose monitoring.
eODE: ordinary differential equation.
Key findings included:
- Data-driven models (14 studies, 50%): Applied for HbA1c forecasting, glycemic risk scoring, and behavior modeling [,-,,,,,].
- Hybrid models (10 studies, 35.7%): Enabled adaptive insulin dosing and feedback control by integrating ML with mechanistic physiology [,-,,,,,].
- Mechanistic models (5 studies, 17.9%): Focused on ODE-based glucose-insulin dynamics for simulation and metabolic exploration [-].
- Conceptual frameworks (1 study, 3.6%): Proposed theoretical DT architecture without implementation [].
System Components (RQ2)
Most DT systems consisted of modular components supporting prediction, simulation, control, and user interaction. The most common modules were ML or AI components, followed by simulation engines and data integration layers. User-facing dashboards and decision support or control modules were also frequently described, while personalization layers, backend infrastructure, and rule-based systems were less common. summarizes the system component categories reported in diabetes DT models, including their functions and representative examples.
| System component category | Key characteristics | Studies, n | Representative examples |
| ML/AI module | LSTM, CNN, reinforcement learning | 20 | Zhang et al [], Pellizzari et al [], Chen et al [], Joshi et al [] |
| Simulation engine | Glucose-insulin model, ReplayBG engine, ODE-based simulator | 17 | Wang et al [], Zavitsanou et al [], Mishra et al [], Leszczełowska et al [] |
| Data integration layer | CGM devices, IoT sensors, preprocessing layer | 12 | Vaskovsky and Chvanova [], Ahmadasas et al [], Villa-Tamayo et al [], Shamanna et al [] |
| User interface or dashboard | Mobile apps, web dashboards, patient interfaces | 11 | Colmegna et al [], Leszczełowska et al [], Shamanna et al [], Shamanna et al [] |
| Decision support or control feedback module | MPC, PID controller, feedback system | 10 | Shamanna et al [], Cappon et al [], Young et al [], Zhu et al [] |
| Intervention or recommendation engine | GPT-based module, precision nutrition, lifestyle recommendations | 9 | Sarani Rad et al [], Shamanna et al [], Young et al [], Shamanna et al [] |
| Personalization layer | Personalization engine, patient-specific tuning | 5 | Cappon et al [], Young et al [], Pellizzari et al [], Chen et al [] |
| Monitoring and alerts | Real-time alerts, patient monitoring, CGM-based tracking | 4 | Shamanna et al [], Shamanna et al [], Shamanna et al [], Vaskovsky et al [] |
| Backend or platform infrastructure | Cloud platform, database engine, analytics engine | 3 | Vaskovsky et al [], Chahal et al [], Cappon et al [] |
| Knowledge representation or semantic layer | Knowledge graphs, ontologies | 2 | Sarani Rad et al [], Zhang et al [] |
| Rule-based decision system | Expert system, rule tables | 2 | Shamanna et al [], Zhu et al [] |
aML: machine learning.
bAI: artificial intelligence.
cLSTM: long short-term memory.
dCNN: convolutional neural network.
eODE: ordinary differential equation.
fCGM: continuous glucose monitoring.
gIoT: internet of things.
hMPC: model predictive control___.
iPID: proportional-integral-derivative___.
jGPT: generative pre-trained transformer.
Key findings included:
- ML or AI modules (20, 71.4% studies) were central to prediction, therapy optimization, and personalization [-,-,,,-].
- Simulation engines (17, 60.7% studies) provided physiological modeling and glucose-insulin dynamics for testing and validation [,,,,,,,-,,].
- Data integration layers (12, 42.9% studies) supported real-time data collection from CGM, internet-of-things sensors, and preprocessing pipelines [,,,,,,,,,-].
- User interfaces (11, 39.3% studies) enabled interaction for patients and clinicians through mobile apps or dashboards [,,,-,,,,,].
Modeling Approaches (RQ3)
The computational strategies used in diabetes DT systems reflect both the predictive and control needs of these models. While ML was the dominant method, several studies incorporated reinforcement learning, control theory, and signal processing for adaptive and real-time decision-making. summarizes the range of modeling techniques reported across included studies, with representative examples.
| Modeling approach category | Key characteristics | Studies, n | Representative examples |
| Machine learning | Random forest, LSTM, CNN, gradient boosting | 20 | Vaskovsky and Chvanova et al [], Shamanna et al [], Colmegna et al [], Batagov et al [] |
| Statistical or probabilistic methods | Logistic regression, Bayesian inference, survival analysis | 9 | Shamanna et al [], Leszczełowska et al [], Zhu et al [], Vaskovsky et al [] |
| Physiological modeling | ODEs, mechanistic models, compartmental models | 8 | Cappon et al [], Ahmadasas et al [], Young et al [], Wang et al [] |
| Control theory | MPC, optimal control, PID controllers | 3 | Ahmadasas et al [], Wang et al [], Zavitsanou et al [] |
| Reinforcement learning | DQN, Soft Actor–Critic | 2 | Sarani Rad et al [], Chen et al [] |
| Control, estimation, or signal processing | Kalman filtering, signal estimation, signal processing algorithms | 3 | Ahmadasas et al [], Zavitsanou et al [], Vaskovsky et al [] |
| Optimization or model calibration | Parameter estimation, parameter fitting algorithms | 2 | Ahmadasas et al [], Thamotharan et al [] |
| Rule-based systems | Dynamic risk thresholding, equation-based bolus calculation, rule-based reasoning | 2 | Pellizzari et al [], Zhu et al [] |
| Simulation-based modeling | Simulation training, Euler’s method | 2 | Mishra et al [], Chen et al [] |
| Natural language processing | GPT-based natural language generation | 1 | Cappon et al [] |
| System dynamics | Causal loop diagrams, feedback modeling | 1 | Mishra et al [] |
aLSTM: long short-term memory.
bCNN: convolutional neural network.
cODE: ordinary differential equation.
dMPC: model predictive control ___.
ePID: proportional-integral-derivative__
fDQN: deep Q-network___.
gGPT: generative pre-trained transformer.
Key findings included:
- ML was the most common approach (20, 71.4% studies), used for glucose prediction, patient modeling, and feature extraction [,-,-,,,,-].
- Statistical and probabilistic methods appeared in 9 (32.1%) studies, often applied to regression, inference, or survival analysis [,,,,,,,,].
- Physiological modeling was reported in 8 (28.6%) studies, leveraging ordinary differential equations, compartmental models, and mechanistic representations [,,-,,,].
- Control-based approaches were less frequent, with control theory (3, 10.7% studies) [,,] and reinforcement learning (2, 7.1% studies) supporting adaptive insulin delivery and personalization [,].
Target Conditions and Use Context (RQ4, RQ5)
Overview
This section summarizes the specific types of diabetes addressed in DT studies (RQ4) and the clinical goals these models aim to support (RQ5). Together, these questions provide insight into intended use cases and patient populations for DT applications in diabetes care.
Target Conditions (RQ4)
DT studies in diabetes addressed multiple forms of the disease, with some models applicable to more than 1 type. Most studies focused on type 1 diabetes (T1D) or type 2 diabetes (T2D), whereas fewer studies targeted gestational diabetes or diabetes-related complications. summarizes the targeted diabetes types and disease stages addressed across the included studies, with representative examples.
| Diabetes type | Studies, n | Representative examples |
| Type 2 diabetes | 14 | Colmegna et al [], Mishra et al [], Villa-Tamayo et al [], Shamanna et al [] |
| Type 1 diabetes | 13 | Thamotharan et al [], Wang et al [], Zavitsanou et al [], Batagov et al [] |
| Diabetic retinopathy (secondary to diabetes) | 1 | Chahal et al [] |
| Gestational diabetes | 1 | Leszczełowska et al [] |
Key findings included:
- T2D was the most frequent target (14, 50% studies), with models supporting therapy optimization, metabolic simulation, and lifestyle interventions [,-,-,,,,].
- T1D was addressed in 13 (46.4%) studies, primarily through closed-loop systems, real-time insulin delivery, and glucose control simulations [,-,,-,,,,].
- Diabetic complications were rarely considered, with 1 (3.6%) study focused on diabetic retinopathy [].
- Gestational diabetes was examined in 1 (3.6%) study, reflecting limited application to pregnancy-related diabetes [].
Clinical Goals (RQ5)
DT applications in diabetes addressed a broad range of clinical objectives, spanning real-time monitoring, safety, decision support, and long-term disease management. These goals were classified into primary categories reflecting their roles in clinical care. summarizes the clinical goals of diabetes DTs, including their functions and representative examples.
| Clinical goal category | Key characteristics | Studies, n | Representative examples |
| Therapeutic control or intervention | Insulin dosing, glycemic variability management, closed-loop control | 17 | Colmegna et al [], Wang et al [], Zavitsanou et al [], Shamanna et al [] |
| Decision support or treatment planning | Dietary recommendation, therapy optimization, clinician support | 10 | Colmegna et al [], Thamotharan [], Mishra et al [], Cappon et al [] |
| Safety or alerting system | Hypoglycemia alerts, early glycemic warnings, safety enhancement | 10 | Young et al [], Pellizzari et al [], Chen et al [], Joshi et al [] |
| Disease prediction or forecasting | Glucose forecasting, disease progression prediction, GDM risk | 9 | Sarani Rad et al [], Shammana et al [], Zhang et al [], Joshi et al [] |
| Disease management or remission | HbA1c reduction, weight loss, medication reduction | 8 | Shamanna et al [], Shamanna et al [], Shamanna et al [], Surian et al [] |
| Monitoring or control | Glucose time-in-range, health monitoring, normoalbuminuric | 6 | Sarani Rad et al [], Young et al [], Mishra et al [], Shamanna et al [] |
| Diagnosis or screening | DR detection, GDM diagnosis, complication screening | 4 | Zhang et al [], Leszczełowska et al [], Vaskovsky et al [], Chahal et al [] |
| Risk assessment | Maternal risk, risk stratification | 3 | Leszczełowska et al [], Villa-Tamayo et al [], Vaskovsky et al [] |
aGDM: gestational diabetes mellitus.
bHbA1c: hemoglobin A1c.
cDR: diabetic retinopathy.
Key findings included:
- Therapeutic control or intervention was the most common application (17, 60.7% studies), including insulin dosing, closed-loop control, and management of glycemic variability [,,,-,,-,,,,].
- Decision support and treatment planning were reported in 10 (35.7%) studies, covering dietary recommendations, therapy optimization, and clinician-facing guidance [,-,-,,,].
- Safety and alerting systems also appeared in 10 (35.7%) studies, emphasizing hypoglycemia warnings and proactive risk alerts [-,-,,,].
- Disease prediction or forecasting was described in 9 (32.1%) studies, targeting HbA1c trajectories, disease progression, and gestational diabetes risk [,-,,,,,].
In , “Therapeutic control or intervention” refers to systems that actively optimize or recommend treatment actions, such as insulin dosing or therapy adjustment; “monitoring or control” refers to systems focused on tracking glycemic status or physiological trends; and “decision support or treatment planning” refers to systems that inform clinician or patient decision-making without necessarily acting as real-time controllers.
Data Sources and Personalization Mechanisms (RQ6, RQ7)
Overview
This section summarizes the types of data used to construct or update DTs for diabetes (RQ6) and the mechanisms through which these models enable personalization or self-management (RQ7). These aspects reflect both the technical input and patient-centered application of DT systems.
Data Sources (RQ6)
DT models drew on a wide range of data sources to ensure an accurate representation of patient state and dynamics. These included lifestyle, sensor-derived, clinical, and synthetic datasets, with varying degrees of adoption across studies. summarizes the data sources used in diabetes DT systems, with representative examples.
| Data category | Key characteristics | Studies, n | Representative examples |
| Lifestyle data | Physical activity, dietary intake, sleep patterns | 20 | Colmegna et al [], Zavitsanou et al [] , Shamanna et al [], Shamanna et al [] |
| Wearable devices | Heart rate, insulin delivery data, blood pressure, | 19 | Sarani Rad et al [], Chen et al [], Shamanna et al [], Shamanna et al [] |
| CGM | CGM data, blood glucose measurements, glucose monitors | 18 | Vaskovsky and Chvanova [], Colmegna et al [], Shamanna et al [], Shamanna et al [] |
| Electronic health records | Clinical history, laboratory results, patient demographics | 12 | Shamanna et al [], Villa-Tamayo et al [], Shamanna et al [], Shamanna et al [] |
| Simulated and public datasets | PIMA dataset, UVa/Padova simulator, synthetic NHANES data | 6 | Wang et al [], Zavitsanou et al [], Mishra et al [], Chahal et al [] |
| Physiological parameters | Body weight, personal characteristics, physiological metrics | 5 | Ahmadasas et al [], Thamotharan et al [], Zavitsanou et al [], Pellizzari et al [] |
| Patient-reported outcomes | Mobile health logs, self-monitoring, patient-generated input | 3 | Sarani Rad et al [], Zhang et al [], Pellizzari et al [] |
| Genomic data | Metabolomics, proteomics | 1 | Zhang et al [] |
| Imaging data | Fundus images, Optos scans, Gaussian-filtered visuals | 1 | Chahal et al [] |
aCGM: continuous glucose monitoring.
bPIMA: Pima Indians Diabetes Dataset.
cNHANES: National Health and Nutrition Examination Survey.
Key findings included:
- Lifestyle data were the most widely used input (20, 71.4% studies), covering physical activity, dietary intake, sleep, and behavioral logs [-,-,,,].
- Wearable devices were incorporated in 19 (67.9%) studies, capturing heart rate, insulin delivery, and blood pressure [,-,-,-,,,,,].
- CGM appeared in 18 (64.3%) studies, enabling real-time tracking, control feedback, and risk forecasting [,-,-,,,,].
- EHRs were used in 12 (42.9%) studies, providing longitudinal medical history, laboratory results, and medication data [,-,-,,].
- Synthetic and public datasets were used in 6 (21.4%) studies, often for simulation or benchmarking, such as the UVa/Padova simulator or National Health and Nutrition Examination Survey (NHANES) data [-,,,].
Personalization Mechanisms (RQ7)
Most DT systems aimed to enable personalized care through individualized feedback, adaptive modeling, or real-time decision support. Personalization strategies varied in scope, ranging from lifestyle guidance to therapy optimization and digital coaching. summarizes the personalization features and tailoring strategies used in diabetes DT systems, with representative examples.
| Personalization mechanism category | Key characteristics | Studies, n | Representative studies |
| Personalized lifestyle recommendations | Nutrition guidance, individualized meal or activity plans, lifestyle support | 11 | Cappon et al [], Young et al [], Shamanna et al [], Shamanna et al [] |
| Real-time or adaptive personalization | Dynamic feedback, CGM-based tuning, adaptive intervention planning | 11 | Chen et al [], Leszczełowska et al [], Vaskovsky et al [], Chahal et al [] |
| Personalized insulin or therapy optimization | Personalized virtual patients, ReplayBG, health scenario simulation | 10 | Shamanna et al [], Ahmadasas et al [], Zhu et al [], Cappon et al [] |
| Self-management tools or patient interface | App feedback, color-coded food systems, personalized tracking tools | 8 | Sarani Rad et al [], Shamanna et al [], Shamanna et al [], Surian et al [] |
| Individualized simulation models | Personalized virtual patients, ReplayBG, health scenario simulation | 6 | Sarani Rad et al [], Zavitsanou et al [], Pellizzari et al [], Chen et al [] |
| Behavior-driven personalization | AI-guided nudges, digital coaching, human support | 4 | Shamanna et al [], Colmegna et al [], Shamanna et al [], Surian et al [] |
| Safety or alerting system | Tailored alerts, risk-specific notifications | 1 | Vaskovsky et al [] |
aCGM: continuous glucose monitoring.
Key findings:
- Personalized lifestyle recommendations were the most frequent approach (11, 39.3% studies), providing tailored nutrition, activity, and daily routine guidance [,,,-,,].
- Real-time or adaptive personalization was also reported in 11 (39.3% studies), offering interventions dynamically responsive to CGM and sensor feedback [,,,,,,,,-].
- Personalized insulin or therapy optimization appeared in 10 (35.7% studies), focusing on precision dosing, medication planning, and adaptive therapy [,,,,,-,,].
- Individualized simulation models were described in 6 (21.4%) studies, enabling patient-specific scenario testing and comparative evaluation [,,-,].
Intelligence and Adaptability (RQ8)
Overview
This section explores how DT systems in diabetes manage uncertainty, real-time data updates, and interpretability. These features are central to ensuring the trustworthiness, safety, and clinical relevance of DT models in dynamic health care settings.
Handling Uncertainty, Adaptation, and Interpretability (RQ8)
Based on the 28 included studies, 5 main categories of strategies were identified. summarizes the strategies used for handling uncertainty, dynamic adaptation, and interpretability in diabetes DT systems.
| Strategy category | Key characteristics | Studies, n | Representative examples |
| Adaptive learning | Feedback loop tuning, model retraining, dynamic personalization | 18 | Shamanna et al [], Thamotharan et al [], Mishra et al [], Vaskovsky et al [] |
| Explainable AI | Feature importance, knowledge graphs, visual interpretability | 16 | Vaskovsky and Chvanova [], Colmegna et al [], Wang et al [], Chahal et al [] |
| Real-time synchronization | Real-time CGM updates, Kalman filtering, continuous data sync | 15 | Colmegna et al [], Ahmadasas et al [], Wang et al [], Chahal et al [] |
| Confidence scoring | Cross-validation, confidence intervals, robustness testing | 12 | Vaskovsky and Chvanova [], Zavitsanou et al [], Mishra et al [], Zhang et al [] |
| Human-in-the-loop | Physician monitoring, manual oversight, feedback mechanisms | 3 | Shamanna et al [], Mishra et al [], Shamanna et al [] |
aAI: artificial intelligence.
bCGM: continuous glucose monitoring.
Key findings included:
- Adaptive learning was the most common capability (18, 64.3% studies), enabling dynamic personalization through feedback loop tuning, model retraining, and continuous parameter updates [,-,-,,-,,,].
- Explainable AI appeared in 16 (57.1%) studies, using methods such as feature importance analysis, visual interpretability, and knowledge graphs to improve transparency [,,-,,,,,,,-].
- Real-time synchronization was reported in 15 (53.6%) studies, supporting continuous data integration from CGM and other sensors via Kalman filtering and real-time updates [,,-,,,,,,-].
- Confidence scoring approaches were applied in 12 (42.9%) studies, employing cross-validation, CIs, and robustness testing to quantify uncertainty [-,,,,,,,,,].
- Human-in-the-loop oversight was reported in 3 (10.7%) studies, providing physician monitoring or manual intervention in safety-critical contexts [,,].
Evaluation and Validation (RQ9, RQ10)
Overview
This section summarizes reported outcomes from DT applications in diabetes (RQ9) and describes the methods used to validate these systems (RQ10). Together, these questions address the effectiveness and credibility of DT models in clinical and experimental contexts.
Reported Outcomes (RQ9)
Across the 28 included studies [,-,-], reported outcomes varied widely depending on the DT system’s clinical target and implementation maturity. Outcomes were grouped into major categories reflecting both clinical and system-level effects. summarizes the clinical outcomes of DTs for diabetes.
| Outcome category | Key characteristics | Studies, n | Representative examples |
| Improved HbA1c or glycemic control | Increased time in range, HbA1c reduction, improved control | 17 | Cappon et al [], Thamotharan et al [], Wang et al [], Chen et al [] |
| Other clinical benefits | Retinopathy or nephropathy improvement, cardiovascular risk reduction | 11 | Shamanna et al [], Colmegna et al [], Leszczełowska et al [], Villa-Tamayo et al [] |
| Improved prediction accuracy | Accurate glucose or GDM prediction, low RMSE or MAE | 9 | Vaskovsky and Chvanova [], Zavitsanou et al [], Leszczełowska et al [], Chahal et al [] |
| Medication use reduction | Reduced or discontinued medication use | 6 | Shamanna et al [], Shamanna et al [], Shamanna et al [], Surian et al [] |
| Weight or metabolic outcomes | Weight loss, improved insulin resistance, BMI reduction | 5 | Shamanna et al [], Shamanna et al [], Shamanna et al [], Surian et al [] |
| Hypo- or hyperglycemia reduction | Fewer glycemic events, improved variability | 5 | Thamotharan et al [], Zavitsanou et al [], Chen et al [], Zhu et al [] |
| T2D remission or reversal | Diabetes remission or reversal | 3 | Shamanna et al []. Shamanna et al [], Surian et al [] |
| Improved detection or screening | Higher detection rates, classification accuracy | 2 | Mishra et al [], Vaskovsky et al [] |
| Blood pressure outcomes | Hypertension remission, reduced SBP/DBP | 2 | Shamanna et al [], Shamanna et al [] |
| Early detection or decision support | Improved early intervention | 1 | Vaskovsky et al [] |
| Enhanced patient engagement | Improved patient comprehension and engagement | 1 | Sarani Rad et al [] |
| Patient or clinician satisfaction | High clinician satisfaction | 1 | Zhu et al [] |
| Personalized therapy optimization | Enhanced insulin dosing precision | 1 | Ahmadasas et al [] |
aHbA1c: hemoglobin A1c.
bGDM: gestational diabetes mellitus.
cRMSE: root mean square error.
dMAE: mean absolute error.
eT2D: type 2 diabetes.
fSBP: systolic blood pressure.
gDBP: diastolic blood pressure.
Key findings:
- Improved HbA1c or glycemic control was the most frequently reported outcome (17, 60.7% studies), showing HbA1c reduction, increased TIR, and reduced variability [,-,-,,,-,,,].
- Other clinical benefits were described in 11 (39.3%) studies, including retinopathy or nephropathy improvement and cardiovascular risk reduction [,-,,,,,,,].
- Improved prediction accuracy was reported in 9 (32.1%) studies, with accurate glucose or gestational diabetes mellitus prediction and low root-mean-square error (RMSE) and mean absolute error (MAE) [,,,,,,,,].
- Less frequently, outcomes included medication use reduction, weight or metabolic improvements, hypo- or hyperglycemia reduction, and other patient-centered measures.
Reported quantitative outcomes suggest that some DT applications were associated with clinically meaningful improvements, although results varied by study design and use case. In 1 retrospective T2D cohort, HbA1c decreased from 8.8% to 6.9% after 90 days, corresponding to a 1.9 percentage-point reduction, together with a 56.9% reduction in homeostatic model assessment of insulin resistance, a 6.1% decrease in body weight, and 89.1% (57/64) of participants achieving time in range (70‐180 mg/dL) ≥70% after the intervention []. In a DT-based exercise decision support system for T1D, mean time in range improved from 80.2% to 92.3% for aerobic exercise and from 72.3% to 87.3% for resistance exercise, while time spent in low glucose decreased from 15.1% to 5.1% and from 18.2% to 6.6%, respectively []. A mechanistic personalized nutrition model in prediabetes predicted individual body weight and HbA1c trajectories with mean prediction errors of 0.7 kg and 0.08 percentage points in the training dataset, and approximately 1.1% and 1.4% percentage errors, respectively, in the test dataset []. Some prediction-focused systems also reported strong performance metrics, including RMSE 24.96 mg/dL, MAE 17.21 mg/dL, and area under the receiver operating characteristic curve >0.85 for postprandial glucose prediction, as well as area under the curve (AUC) of 0.80‐0.82 for chronic kidney disease identification and AUC 0.86 for 3-year chronic kidney disease prediction in T2D cohorts [,]. In maternal-risk applications, 1 DT system reported 83.5% accuracy for maternal health risk assessment and 97.2% precision for gestational diabetes prediction [].
Validation Methods (RQ10)
Validation approaches were grouped into 5 broad categories, reflecting how DT systems were evaluated for performance, safety, and generalizability. summarizes the validation methods used in diabetes DT systems.
| Validation method category | Key characteristics | Studies, n | Representative examples |
| Quantitative evaluation | Accuracy metrics (eg, RMSE and AUC), statistical tests, cross-validation | 21 | Vaskovsky and Chvanova [], Mishra et al [], Leszczełowska et al [], Vaskovsky et al [] |
| Retrospective validation | Cross-validation, train or test split, retrospective data analysis | 10 | Thamotharan et al [], Joshi et al [], Villa-Tamayo et al [], Batagov et al [] |
| Simulation testing | ReplayBG or UVa/Padova simulation, virtual cohort evaluation | 9 | Young et al [], Wang et al [], Pellizzari et al [], Chen et al [] |
| Clinical trials | Randomized controlled trial, pilot study, prospective design | 4 | Shammana et al [], Shamanna et al [], Zhu et al [], Cappon et al [] |
| Real-world validation | Clinical evaluation, patient outcomes, CGM tracking | 4 | Shamanna et al [], Colmegna et al [], Zhu et al [], Surian et al [] |
| Expert review | Case study evaluation, user feedback | 2 | Shamanna et al [], Zhu et al [] |
aRMSE: root-mean-square error.
bAUC: area under the curve.
cCGM: continuous glucose monitoring.
Key findings included:
- Quantitative evaluation was the most common approach (21, 75% studies), typically using accuracy metrics (eg, RMSE, MAE, and AUC) and cross-validation methods to assess performance [-,-,,,-,,].
- Retrospective validation was applied in 10 (35.7%) studies, using historical datasets (eg, EHRs and CGM logs) for training or testing and retrospective analysis [,,,,,,,,,].
- Simulation testing was reported in 9 (32.1%) studies, often leveraging tools, such as the UVa/PADOVA simulator or ReplayBG, to validate insulin control and metabolic models [,,,-,,].
- Clinical and real-world evaluation was limited, with clinical evaluation reported in 4 studies (14.3%) [,,,] and real-world evaluation reported in 4 (14.3%) studies [,,,], including small-scale pilots, randomized controlled trials, or deployment in real patient settings with CGM tracking.
- Expert review was rarely used, reported in 2 (7.1%) studies, based on clinician or user feedback or case study evaluation [,].
Implementation and Governance (RQ11, RQ12)
Overview
This section describes how ethical, legal, and practical considerations are addressed in the implementation of DT systems for diabetes. It summarizes reported privacy and regulatory strategies (RQ11) and examines technical and workflow-related barriers to deployment (RQ12). Together, these questions assess readiness for safe, responsible, and scalable clinical integration.
Privacy, Ethical, and Regulatory Considerations (RQ11)
DT systems introduce complex ethical and legal considerations due to their reliance on sensitive health data and AI-driven decision-making. Among the 28 studies [,-,-], 4 high-level categories were identified—data privacy, consent and transparency, accountability, and bias or fairness. summarizes the strategies used for handling privacy, ethical, and regulatory issues in diabetes DT systems.
| Ethics or privacy category | Key characteristics | Studies, n | Representative examples |
| Data privacy | Data anonymization, encryption, GDPR or HIPAA compliance | 8 | Mishra et al [], Zhu et al [], Vaskovsky et al [], Chahal et al [] |
| Accountability | Audit trails, regulatory compliance, and interoperability | 6 | Cappon et al [], Zhu et al [], Vaskovsky et al [], Chahal et al [] |
| Consent and transparency | Data ownership, ethics approval obtained, informed consent, patient consent, permission-based data storage | 6 | Zhu et al [], Vaskovsky et al [], Chahal et al [], Cappon et al [] |
| Bias and fairness | Identification of bias potential | 1 | Sarani Rad et al [] |
aGDPR: General Data Protection Regulation.
bHIPAA: Health Insurance Portability and Accountability Act.
Key findings included:
- Data privacy was the most frequently discussed (8, 28.6% studies), typically through anonymization, encryption, and compliance with HIPAA (Health Insurance Portability and Accountability Act) or GDPR (General Data Protection Regulation) [,,,,-].
- Accountability appeared in 6 (21.4%) studies, including the use of audit trails, traceability, and regulatory compliance mechanisms [-,-].
- Consent and transparency were also reported in 6 (21.4%) studies, covering informed consent procedures, institutional review board approvals, and patient-facing disclosures [,,-].
- Bias and fairness were noted in only 1 (3.6%) study, reflecting a critical underexplored gap in addressing algorithmic inequity [].
Implementation Barriers and Enablers (RQ12)
Although many DT systems demonstrated technical feasibility, real-world implementation remains constrained by several recurring challenges. These were grouped into 4 main categories—data quality and availability, model limitations, validation limitations, and workflow or interoperability barriers. summarizes the implementation barriers that exist in diabetes DT systems.
| Implementation barriers category | Key characteristics | Studies, n | Representative examples |
| Validation limitation | Lack of randomization, short follow-up, and personalization gaps | 16 | Wang et al [], Zavitsanou et al [], Villa-Tamayo [], Shamanna et al [] |
| Data quality or availability | Burden of data collection, missing variables, limited real-world data, and synthetic datasets | 14 | Valovsky and Chvanova [], Wang et al [], Shamanna et al [], Shamanna et al [] |
| Model limitations | Simplified physiology, tuning complexity, and selection bias | 11 | Ahmadasas et al [], Wang et al [], Leszczełowska et al [], Villa-Tamayo et al [] |
| Workflow and interoperability | Clinical workflow alignment, data format compatibility issues, data integration challenges, data integration from multiple sources, integration with existing devices, and interoperability challenges | 8 | Cappon et al [], Colmegna et al [], Mishra et al [], Zhu et al [] |
Key findings included:
- Validation limitations were the most common barrier (16, 57.1% studies), reflecting reliance on synthetic datasets, short follow-up durations, and lack of external clinical evaluation [,-,-,,,].
- Data quality and availability issues were reported in 14 (50%) studies, including missing data, unreliable sensors, and burdensome data collection procedures [,,,,,,,,-,,].
- Model limitations were described in 11 (39.3%) studies, such as limited personalization, oversimplified physiological modeling, or small training datasets [,,,,,,,,,,].
- Workflow and interoperability barriers appeared in 8 (28.6%) studies, emphasizing difficulties integrating DTs into clinical workflows, EHR systems, and device ecosystems [,,,,-].
Research and Development Gaps (RQ13)
Although DT systems for diabetes are showing technical feasibility, multiple areas require further investigation and refinement. From the 28 reviewed studies [,-,-], seven major gap categories that emerged were (1) limited scope of application, (2) integration challenges, (3) lack of longitudinal data, (4) data quality and availability, (5) methodological limitations, (6) need for clinical validation, and (7) scalability or usability concerns. summarizes the reported research and development gaps in diabetes DT systems.
| Gap category | Key characteristics | Studies, n | Representative examples |
| Need for clinical validation | Larger clinical trials and subgroup and demographic validation | 15 | Shamanna et al [], Thamotharan et al [], Shamanna et al [], Zhu et al [] |
| Limited scope of application | Broader populations, diverse settings, and multimorbidity expansion | 14 | Sarani Rad et al [], Shamanna et al [], Cappon et al [], Thamotharan et al [], Zhang et al [] |
| Integration challenges | Integration with EHRs, real-time systems, and closed-loop models | 11 | Thamotharan et al [], Joshi et al [], Zhu et al [], Batagov et al [] |
| Usability and real-world adoption | Personalization for MDI users, real-world evaluation, and broader adoption | 11 | Ahmadasas et al [], Wang et al [], Vaskovsky et al [], Chahal et al [] |
| Lack of longitudinal data | Long-term outcome tracking, sustainability, and effectiveness studies | 8 | Shamanna et al [], Shamanna et al [], Cappon et al [], Surian et al [] |
| Data quality and availability | Dependence on wearable devices and data quality, expansion to broader population data, expansion to larger datasets, limitations in meal tracking and calibration, and need for denser time-series data | 6 | Vaskovsky and Chvanova [], Wang et al [], Mishra et al [], Villa-Tamayo [] |
| Methodological limitations | Standardized protocols, adaptive learning, and causal reasoning | 5 | Sarani Rad et al [], Vaskovsky and Chvanova [], Colmegna et al [], Pellizzari et al [] |
| Scalability challenges | Deployment in low-resource settings and real-world scalability | 3 | Leszczełowska et al [], Zhu et al [], Chahal et al [] |
aEHR: electronic health record.
bMDI: multiple daily injection.
Key findings included:
- Need for clinical validation was the most frequently cited gap (15, 53.6% studies), reflecting the lack of randomized trials, subgroup evaluations, and real-world testing [,,-,,,,,,,,,].
- Limited scope of application was reported in 14 (50%) studies, with DTs often targeting narrow use cases and failing to generalize across diverse populations or multimorbidity contexts [,-,,-,-,].
- Integration challenges were noted in 11 (39.3%) studies, underscoring difficulties with EHR interoperability, real-time deployment, and multidevice environments [,,,,,,,-].
- Usability and real-world adoption also appeared in 11 (39.3%) studies, pointing to the need for personalization, support for multiple daily injection users, and strategies for broader adoption in routine care [,,,,,,,,-].
Discussion
This PRISMA-ScR–compliant scoping review maps the current state of DT systems in diabetes, addressing 13 structured research questions across 7 thematic domains.
System Design and Modeling Foundations (RQ1, RQ2, RQ3)
DT systems for diabetes use a wide range of modeling techniques, most commonly ML (eg, long short-term memory, gradient boosting, and reinforcement learning) and physiological simulation. Simulation engines and predictive ML modules were often integrated into layered architectures that also included personalization modules, decision support, and user-facing dashboards. Statistical and probabilistic methods (eg, regression and Bayesian inference) were also used in several studies, although less prominently. Few systems incorporated mechanistic control theory or signal-processing models. The inclusion of key components, such as simulation engines, control-feedback modules, and data integration pipelines, reflects a growing maturity in system design.
Target Conditions and Use Context (RQ4 and RQ5)
Most DTs targeted T1D or T2D, with limited applications in gestational diabetes or diabetes-related complications, such as retinopathy. Primary clinical goals included glycemic prediction, insulin-dose optimization, lifestyle guidance, and therapeutic planning. Several systems also addressed the diagnosis of complications or risk stratification for comorbidities. The breadth of clinical use cases suggests that DTs are evolving from simple simulators into multifunctional clinical-support tools.
Data Sources and Personalization Mechanisms (RQ6 and RQ7)
Lifestyle data, wearable devices, and CGM were the dominant inputs, with hybrid combinations being common. EHRs and synthetic datasets were also widely used to provide historical or simulated information. Personalization was achieved through mechanisms such as real-time adaptation, individual model tuning, behavior-driven feedback (eg, nudges), and insulin titration. However, persistent challenges remain in data quality, sensor integration, and dataset heterogeneity.
Intelligence and Adaptability (RQ8)
Managing uncertainty and real-time updates is crucial for clinical reliability. Studies implemented adaptive learning, feedback loops, and explainable-AI methods (eg, attention mechanisms and knowledge graphs) to improve transparency and adaptability. Real-time CGM synchronization and, in some cases, human-in-the-loop oversight were used to enhance model responsiveness and safety.
Evaluation and Validation (RQ9 and RQ10)
Quantitative validation (eg, RMSE and AUC) was common, but real-world clinical trials were rare. Most studies validated systems via retrospective datasets or simulations. Reported clinical outcomes included improved TIR, fewer hypoglycemic events, and, in some cases, T2D remission. However, evidence on long-term effectiveness, generalizability, and cost-effectiveness remains limited.
A notable finding across the included studies is the mismatch between technical sophistication and clinical maturity. Although many DT systems incorporated adaptive learning, individualized simulation, and multimodal data integration, most were evaluated using retrospective datasets or in silico simulations rather than prospective clinical deployment. This likely reflects the high implementation burden of DTs in diabetes, including the need for reliable real-time data streams, safety safeguards, interoperability with devices and clinical systems, and acceptable workflow integration. It also reflects the regulatory complexity of systems that may influence insulin dosing or therapeutic decision-making.
Implementation and Governance (RQ11, RQ12)
Privacy and ethical considerations were addressed inconsistently, often limited to brief compliance mentions (eg, GDPR and HIPAA). A smaller subset of studies explicitly discussed accountability (eg, audit trails and governance mechanisms) or algorithmic bias and fairness, highlighting underexplored areas of governance. Implementation enablers included real-time feedback and sensor integration, whereas barriers included poor data quality, system complexity, lack of clinical workflow alignment, and limited scalability.
Another important finding is the limited attention to algorithmic bias and fairness. Despite the increasing use of AI-driven modeling and decision-support approaches, only a small subset of studies explicitly discussed bias, representativeness, or equity-related concerns. This suggests that the field is still focused primarily on technical feasibility and predictive performance rather than equitable deployment across diverse patient populations.
Research and Development Gaps (RQ13)
Key gaps include limited clinical validation, insufficient longitudinal data, a lack of standardized model architectures, and limited generalizability to diverse populations. Many studies emphasized the need for integration with EHRs, real-world testing, and regulatory alignment. Addressing these gaps will be essential to enable scalable, equitable, and clinically robust DT systems for diabetes management.
Summary and Implications
This review offers a panoramic view of the evolving DT landscape in diabetes. While notable technical advances are evident—particularly in data integration and personalization—the field remains formative, with substantial work needed in clinical validation, ethical governance, and system interoperability. Future research should emphasize not only algorithmic sophistication but also real-world applicability, safety, and equity to support the scalable and responsible deployment of DTs in diabetes care.
Taken together, the literature suggests that DT research in diabetes is progressing from conceptual and simulation-based work toward more clinically relevant systems, but the field remains early in real-world maturity. Future studies should prioritize prospective validation, broader demographic and clinical representation, transparent reporting, interoperability with routine care systems, and governance frameworks that address privacy, accountability, and fairness.
Limitations
This scoping review has several limitations. First, only English-language studies with accessible full text were included, and gray literature was excluded, which may have led to the omission of some relevant studies. Second, formal risk-of-bias and certainty-of-evidence assessments were not performed because the aim was to map a heterogeneous body of literature rather than evaluate intervention effects. Third, the included studies differed substantially in design, terminology, validation methods, and outcomes, limiting direct comparison. Finally, many studies were early-phase, retrospective, or simulation-based, which limits conclusions about clinical effectiveness and real-world implementation.
Acknowledgments
Generative artificial intelligence was used to assist with language editing and manuscript drafting. All AI-assisted output was reviewed, edited, and verified by the authors, who take full responsibility for the final content of the manuscript.
Funding
This work was supported by the National Science Foundation under grants OIA-2218046 and OIA-2611071.
Data Availability
All data analyzed in this scoping review were charted from publicly available publications. The search strategy, screening criteria, extracted study characteristics, and supplementary review materials are provided in the manuscript and its supplementary files. No primary participant-level dataset was generated for this study.
Authors' Contributions
FSR, EB, and MJ contributed to conceptualization. FSR, KK, and MJ contributed to data curation. FSR and EB contributed to formal analysis and methodology. JL contributed to supervision. FSR, KK, and MJ wrote the original draft. JL reviewed and edited the manuscript. All authors read and approved the final manuscript.
Conflicts of Interest
None declared.
References
- Emmert-Streib F. Defining a digital twin: a data science-based unification. MAKE. 2023;5(3):1036-1054. [CrossRef]
- Bruynseels K, Santoni de Sio F, van den Hoven J. Digital twins in health care: ethical implications of an emerging engineering paradigm. Front Genet. 2018;9:31. [CrossRef] [Medline]
- Sarani Rad F, Hendawi R, Yang X, Li J. Personalized diabetes management with digital twins: a patient-centric knowledge graph approach. J Pers Med. Mar 28, 2024;14(4):359. [CrossRef] [Medline]
- American Diabetes Association. Standards of medical care in diabetes. Diabetes Care. Jan 2005;28 Suppl 1(Suppl 1):S4-S36. [CrossRef] [Medline]
- American Diabetes Association. 6. Glycemic Targets: Standards of Medical Care in Diabetes-2021. Diabetes Care. Jan 2021;44(Suppl 1):S73-S84. [CrossRef] [Medline]
- Sun T, He X, Li Z. Digital twin in healthcare: recent updates and challenges. Digit Health. 2023;9:20552076221149651. [CrossRef] [Medline]
- Zhang Y, Qin G, Aguilar B, et al. A framework towards digital twins for type 2 diabetes. Front Digit Health. 2024;6:1336050. [CrossRef] [Medline]
- Shamanna P, Joshi S, Thajudeen M, et al. Personalized nutrition in type 2 diabetes remission: application of digital twin technology for predictive glycemic control. Front Endocrinol (Lausanne). 2024;15:1485464. [CrossRef] [Medline]
- Cappon G, Vettoretti M, Sparacino G, Favero SD, Facchinetti A. ReplayBG: a digital twin-based methodology to identify a personalized model from type 1 diabetes data and simulate glucose concentrations to assess alternative therapies. IEEE Trans Biomed Eng. Nov 2023;70(11):3227-3238. [CrossRef] [Medline]
- Ahmadasas M, Rashid MM, Siket M, Abdel-Latif MM, Shahidehpour A, Cinar A. Personalized artificial pancreas for glucose regulation in people with diabetes. IFAC-PapersOnLine. 2024;58(30):55-60. [CrossRef]
- Joshi S, Shamanna P, Dharmalingam M, et al. Digital twin-enabled personalized nutrition improves metabolic dysfunction-associated fatty liver disease in type 2 diabetes: results of a 1-year randomized controlled study. Endocr Pract. Dec 2023;29(12):960-970. [CrossRef] [Medline]
- Batagov A, Dalan R, Wu A, Lai W, Tan CS, Eisenhaber F. Generalized metabolic flux analysis framework provides mechanism-based predictions of ophthalmic complications in type 2 diabetes patients. Health Inf Sci Syst. Dec 2023;11(1):18. [CrossRef] [Medline]
- Surian NU, Batagov A, Wu A, et al. A digital twin model incorporating generalized metabolic fluxes to identify and predict chronic kidney disease in type 2 diabetes mellitus. NPJ Digit Med. May 24, 2024;7(1):140. [CrossRef] [Medline]
- Shamanna P, Erukulapati RS, Shukla A, et al. One-year outcomes of a digital twin intervention for type 2 diabetes: a retrospective real-world study. Sci Rep. Oct 26, 2024;14(1):25478. [CrossRef] [Medline]
- Shamanna P, Joshi S, Shah L, et al. Type 2 diabetes reversal with digital twin technology-enabled precision nutrition and staging of reversal: a retrospective cohort study. Clin Diabetes Endocrinol. Nov 15, 2021;7(1):21. [CrossRef] [Medline]
- Shamanna P, Joshi S, Dharmalingam M, et al. Digital twin in managing hypertension among people with type 2 diabetes: 1-year randomized controlled trial. JACC Adv. Sep 2024;3(9):101172. [CrossRef] [Medline]
- Shamanna P, Dharmalingam M, Sahay R, et al. Retrospective study of glycemic variability, BMI, and blood pressure in diabetes patients in the Digital Twin Precision Treatment Program. Sci Rep. Jul 21, 2021;11(1):14892. [CrossRef] [Medline]
- Shamanna P, Saboo B, Damodharan S, et al. Reducing HbA1c in type 2 diabetes using digital twin technology-enabled precision nutrition: a retrospective analysis. Diabetes Ther. Nov 2020;11(11):2703-2714. [CrossRef] [Medline]
- Tricco AC, Lillie E, Zarin W, et al. PRISMA Extension for Scoping Reviews (PRISMA-ScR): checklist and explanation. Ann Intern Med. Oct 2, 2018;169(7):467-473. [CrossRef] [Medline]
- Peters MDJ, Marnie C, Tricco AC, et al. Updated methodological guidance for the conduct of scoping reviews. JBI Evid Synth. Oct 2020;18(10):2119-2126. [CrossRef] [Medline]
- Jafarpour M, Bitaraf E, Moeini A, Nahvijou A. Triple A (AAA): a tool to analyze scientific literature metadata with complex network parameters. Presented at: 2023 9th International Conference on Web Research (ICWR); May 3-4, 2023:342-345; Tehran, Iran, Islamic Republic of. [CrossRef]
- Vaskovsky AM, Chvanova MS. Designing the neural network for personalization of food products for persons with genetic president of diabetic sugar. Presented at: 2019 3rd School on Dynamics of Complex Networks and their Application in Intellectual Robotics (DCNAIR); Sep 9-11, 2019:175-177; Innopolis, Russia. [CrossRef]
- Colmegna P, Wang K, Garcia-Tirado J, Breton MD. Mapping data to virtual patients in type 1 diabetes. Control Eng Pract. Oct 2020;103:104605. [CrossRef]
- Young G, Dodier R, Youssef JE, et al. Design and in silico evaluation of an exercise decision support system using digital twin models. J Diabetes Sci Technol. Mar 2024;18(2):324-334. [CrossRef] [Medline]
- Thamotharan P, Srinivasan S, Kesavadev J, et al. Human digital twin for personalized elderly type 2 diabetes management. J Clin Med. Mar 7, 2023;12(6):2094. [CrossRef] [Medline]
- Wang Q, Molenaar P, Harsh S, et al. Personalized state-space modeling of glucose dynamics for type 1 diabetes using continuously monitored glucose, insulin dose, and meal intake: an extended Kalman Filter approach. J Diabetes Sci Technol. Mar 2014;8(2):331-345. [CrossRef] [Medline]
- Zavitsanou S, Mantalaris A, Georgiadis MC, Pistikopoulos EN. In silico closed-loop control validation studies for optimal insulin delivery in type 1 diabetes. IEEE Trans Biomed Eng. Oct 2015;62(10):2369-2378. [CrossRef] [Medline]
- Mishra V, Koul S, Taylor IW. Digital twin for diabetes management using system dynamics simulation: the case of India. 2024. Presented at: International Conference on Computational Intelligence in Communications and Business Analytics; Jan 24-26, 2024:305-313; Patna, India. [CrossRef]
- Pellizzari E, Prendin F, Cappon G, Sparacino G, Facchinetti A. drCORRECT: an algorithm for the preventive administration of postprandial corrective insulin boluses in type 1 diabetes management. J Diabetes Sci Technol. May 2025;19(3):711-721. [CrossRef] [Medline]
- Chen JH, Fukasawa M, Sakane N, et al. Optimization of nutritional strategies using a mechanistic computational model in prediabetes: application to the J-DOIT1 study data. PLoS ONE. 2023;18(11):e0287069. [CrossRef] [Medline]
- Leszczełowska P, Mazur-Milecka M, Kowalczyk N, Sobotka M. Maternal health risk assessment using digital twin application. 2024. Presented at: 2024 16th International Conference on Human System Interaction (HSI); Jul 8-11, 2024. [CrossRef]
- Villa-Tamayo MF, Pavan J, Breton M. Analysis on the practical identifiability of the subcutaneous oral glucose minimal model. IFAC-PapersOnLine. 2024;58(24):269-274. [CrossRef]
- Zhu T, Li K, Herrero P, Georgiou P. GluGAN: generating personalized glucose time series using generative adversarial networks. IEEE J Biomed Health Inform. Oct 2023;27(10):5122-5133. [CrossRef] [Medline]
- Vaskovsky AM, Chvanova MS, Rebezov MB. Creation of digital twins of neural network technology of personalization of food products for diabetics. Presented at: 2020 4th Scientific School on Dynamics of Complex Networks and their Application in Intellectual Robotics (DCNAIR); Sep 7-9, 2020:251-253; Innopolis, Russia. [CrossRef]
- Chahal Y, Tokas R, Sharma K. Smart solution using digital twin and iot for diabetic retinopathy. 2023. Presented at: 2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT); Jul 6-8, 2023. [CrossRef]
- Cappon G, Pellizzari E, Cossu L, et al. System architecture of TWIN: a new digital TWIN-based clinical decision support system for type 1 diabetes management in children. 2023. Presented at: 2023 IEEE 19th International Conference on Body Sensor Networks (BSN); Oct 9-11, 2023. [CrossRef]
- Rad FS, Jafarpour M, Bitaraf E, Khaleghdadi K, Li J. Digital twin applications in diabetes management: scoping review. Open Science Framework. URL: https://osf.io/n49xz/ [Accessed 2026-06-02]
Abbreviations
| AI: artificial intelligence |
| AUC: area under the curve |
| CGM: continuous glucose monitoring |
| DT: digital twin |
| EHR: electronic health record |
| GDPR: General Data Protection Regulation |
| HbA1c: hemoglobin A1c |
| HIPAA: Health Insurance Portability and Accountability Act |
| MAE: mean absolute error |
| ML: machine learning |
| NHANES: National Health and Nutrition Examination Survey |
| OSF: Open Science Framework |
| PRISMA: Preferred Reporting Items for Systematic Reviews and Meta-Analyses |
| PRISMA-ScR: Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews |
| RMSE: root-mean-square error |
| RQ: research question |
| T1D: type 1 diabetes |
| T2D: type 2 diabetes |
| TIR: time-in-range |
Edited by Ivan Steenstra; submitted 27.Aug.2025; peer-reviewed by Marzieh Soheili, Stuart Nelson; final revised version received 17.Apr.2026; accepted 24.Apr.2026; published 18.Jun.2026.
Copyright© Fatemeh Sarani Rad, Maryam Jafarpour, Ehsan Bitaraf, Katayoon Khaleghdadi, Juan Li. Originally published in JMIR Diabetes (https://diabetes.jmir.org), 18.Jun.2026.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Diabetes, is properly cited. The complete bibliographic information, a link to the original publication on https://diabetes.jmir.org/, as well as this copyright and license information must be included.

